ON NEW SUM-PRODUCT TYPE ESTIMATES 



MISHA RUDNEV 

Abstract. New lower bounds involving sum, difference, product, and ratio sets 
for A C C are given. 



1. Introduction 
Erdos and Szemeredi ([3]) conjectured that if yl is a set of integers, then 

where 

A + A = {ai + a2: ai,2 G A} 

is called the sum set of A, the product A ■ A, difference A — A, and ratio A : A 
sets being similarly defined. (In the latter case one should not divide by zero.) 
The notations « are being used throughout to suppress absolute constants 

in inequalities, the symbol o(l) in exponents absorbs logarithmic factors in the 
asymptotic parameter |^|, the cardinality of A. 

Variations of the Erdos-Szemeredi question consider the set A living in other rings 
or fields, as well as replacing, e.g., the sum set with the difference set A — A. The 
conjecture is far from being settled, and therefore partial current "word records" on 
it vary with such variations of the input. 

The best result for reals, for instance, is due to Solymosi ([H]), claiming 

(1) \A + A\ + \A ■ A\ ^ \A\^+^-°^^\ 

and would include the endpoint exponent ^ if A ■ A were replaced by A : A. 

However, the construction is specific for reals, and does not appear either to 
extend easily to the case ^ C C, or to allow for replacing the sum set A + A with 
the difference set A — A. So, if A £ C or if A + A for reals gets replaced by ^ — ^, 
the best known result comes from an older paper of Solymosi ([E]), claiming 

(2) \A±A\ + \A-A\:^\A\^+Ti~°W, 

also without the o(l) term if A ■ A gets replaced by ^ : ^. 

It may be worth mentioning that the result ([2]) is based on the - rather sophis- 
ticated - Szemeredi- Trotter theorem, while ([T]) is not, using just elementary order 
properties of positive reals, expressed in the fact that if | < |, then falls in 
between. This note is almost entirely based on the Szemeredi- Trotter theorem and 
succeeds in slightly improving on ([2]), yet not enough to beat ([T]). 
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Theorem 1. For any A <Z C, with two or more elements one has 



(3) 



2. Lemmata 

The main tool behind the above estimates is the Szemeredi- Trotter incidence 
theorem. For a set P of points and a set of L straight Hnes in a plane let 

I(P,L) = {(p,OePxL: pel] 

be the set of incidences. 

Theorem 2 (Szemeredi and Trotter [16]). The maximum number of incidences in 
'S? is hounded as follows: 

(4) |I(P,L)| «(|P||L|)I + |P| + |L|. 

In particular, if Pt (or Lt) denote the sets of points (or lines) incident to at least 
t>l lines (or points) of L (or P), then 

\Pt\ < 

(5) 

\Lt\ < 

Let us note that the linear in |P|, \L\ terms in the estimates (j31[5]) are essentially 
trivial and usually of no interest in the sense of being dominated by the non-linear 
ones, whenever these estimates are being used. This is also the case in this paper. 

The Szemeredi- Trotter theorem is also true in the plane over C. This was proved 
by Toth ([I3)- Recently there has been a new proof by Solymosi and Tao ([IS]) 
which had to sacrifice the endpoint in the exponent for the sake of elegance of the 
method. 

One can easily develop a weighted version of the Szemeredi- Trotter theorem, see 
losevich et al. ([5J). Suppose each line / G L has been assigned a weight m{l) > 1. 
The number of weighted incidences im{P,L) is obtained by summing over the set 
I{P,L), with each pair {p,l) being counted m{l) times. Suppose, the total weight 
of all lines is W and the maximum weight per line is m. It is argued in [5] that 
the worst possible case for the weighted incidence estimate is the uniform one, when 
there are ^ lines of equal weight m, hence the following theorem. 

Theorem 3. The maximum number of weighted incidences between a point set P 
and a set of lines with the total weight W and maximum weight per line rh is 

(6) im{P,L) <^fhri{\P\W)ri +fh\P\+W. 
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This paper uses in its core the same geometric construction as Solymosi ([13j) did, 
which yielded the exponent with some more detailed analysis of the incidences 
involved by dealing with the weighted case. Yet the improvements over ([2]) are due to 
combining this construction with a recent purely additive-combinatorial observation 
by Shkredov and Schoen ([9], Lemma 3.1) which allowed for a series of the latest 
state-of-the art improvements in incremental progress towards a number of open 
questions in field combinatorics in [9], |12] . [lOj . 

The above-mentioned additive-combinatorial observation is the content of the fol- 
lowing LemmalU quoting which requires some notation used in the sequel. Through- 
out the rest of this section A, B denote any sets in an Abelian group. The following 
energy notations E, when applied in a field will bifurcate into E and E^:, respectively, 
relative to the addition and multiplication operations. 

For any d £ A — A, set 

(7) Ad = {a e A : a + d e A}. 
Denote 

E{A,B) = |{(ai, 02, 61,62) eAxAxBxB: 01-02 = 61- 62}!, 

referred to as the "additive energy" of A,B. By the Cauchy-Schwarz inequality, 
rearranging the terms in the above definition of E{A, B), one has 

(8) E{A,B)\A±B\>\A\'^\B\'^. 

Indeed, if d is an element of vl — i? or s is an element of A + B, and n{d), n{s) are 
the number of realisations of d and s, respectively as a difference or sum of a pair 
of elements from A x B , ^ follows from the fact that 

(9) E{A,B)= ^Hd)= E ^'(^)- 

d€A-B seA+B 

Also, E{A,A) = E{A) is referred to as the "energy of A". In this case note that 
according to the notation ([7]), n{d) = \Aci\. 

Moreover, the following inequality will be useful. If 

(10) D+ = l^deA-A:n{d)>^j^j-^ 

then "the energy supported on D'^" , namely the left-hand side of the next inequality, 
satisfies 

(") E"^M»^- 

deD+ ' ' 

Indeed, by ([8]) with A = B, the energy supported on the complement of is 
trivially bounded from above by the right-hand side of (jlip . 
Also useful will be the "cubic energy" of A, which is 

(12) ^3(A) = |{(oi,...,a6) G ^ X ... X ^ : 01-02 = 03-04 = 05-06}]. 
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This definition implies that (^IQ], Lemma 2) 

(13) Es{A)= E{A,Ad). 



d&A-A 



To see this, fix any d = ai — 02 in (112p and observe that if one is to count every 
representation d = 03 — 04 as many times as 03, 04 S Ad for some d, this will happen 
exactly n{d) times, for different d = 05 — 03 = og — 04 in (fT2]) . 

The following statement is the content of Corollary 3 in [lOj, with trivial varia- 
tions. 

Lemma 1. One has the following identities, for any D' ^ A — A: 



Y,\Ad\\A±Ad\> — 



2 



d&D' ' 

(14) 

y \Ad^\A ± Ad\ > (^^^D'\Ad\')\ 

\ d\ \ d\ - ^ 

d&D' ' 

Proof. To verify the first inequality of (jl6p observe that by the Cauchy-Schwarz 
inequality, for each d: 



^\Ad\\A±Ad\y^EiA,Aa) > \A\\A, 



Summing over d G D' and applying once again the Cauchy- Schwartz inequality to 
the left-hand side yields 



\Ad\\A±Ad\ y^EiA,Ad) > \A\ ^ \A, 
deD' V deD' deD' 



squaring both sides and using (|13p does the job. Verifying the second inequality in 
(jl4p requires merely a straightforward modification of the above. □ 



The identities (jl4l) suggest that the cubic energy estimate from above can be quite 
useful, since A - A^ {A - A) n {A - A - d), as well as A + A^ C {A + A) n {A + 
A + d). The latter observation, which [lOj credits to Katz and Koester ([6j), allows 
for the following interpretation of (jl4p . producing lower bounds for E{A,A — A) 
and E{A, A + A). Indeed, the left-hand side in ([T]) itself is a lower bound for both 
E(A, Aziz A). However, in order to be able to express it in terms of the sum set, via 
([9|), one needs to deal with the sum of l^^l^ in the right-hand side, rather than of 

In the first formula of (|14p assume that D' is a popular subset A — A, namely 
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Then 



In the second formula of ()14p replace D' with D"^, defined by (jlOp . Then 



2 

2 



(16) 



\A-A\E3iAy 
\A\ 



\A + A\^E3{A) maXdgB+ \Ad\' 



2.1. Some applications of Lemma [H The estimates (jl6p enabled Schoen and 
Shkredov (^J) to achieve progress on the sum set of a convex set problem. If 
A = f([l,...,N]), where / is a strictly convex real- valued function, then 1^4 — 
^1 » |^|t~°*^-'^\ with a slightly worse estimate |^ + ^| » |A|"9"~°(i) for the sum- 
set. The reason for the two estimates being different is that dealing with the sum 
set necessitated bootstrapping the earlier established exponent |, which had been 
obtained over the past ten years or so in various guises, with or without using the 
Szemeredi-Trotter theorem. (See e.g. [2], [3], [5]. The conjectured exponent in the 
convex set sum set problem is2 — o(l).) 

Li ([Z]) - see also his recent work with Roche- Newton ([8]) - has adapted the 
approach of [10] to the sum-product problem, using a specific convex function, the 
exponential. This was also observed in another paper of Schoen and Shkredov 
(|11]. Corollary 25). The result was the exponent y| — o(l). A closer look at this 
adaptation - see the Appendix in this paper - reveals that the exponential function 
has as much do with it as basically replacing a with exp(loga) in a variant of the 
old sum-product construction by Elekes (p^): applying the Schoen-Shkredov trick, 
expressed in (fT^ to the estimate (HTj) in the Appendix would improve the Elekes 
exponent | to y| — o(l). 

The same exponent ^ — o(l) had been coincidentally obtained in Solymosi's work 
[H], as stated in ([2]) above. 

This note uses the construction of pL4j and combines it with an estimate of the 
type (jl4p . thereby getting an improvement ([3]) over ([2]). The estimates involving 
the sum set end up being worse, because they require bootstrapping the estimates 
which themselves would yield the exponent ^ — o(l) only. It will also use the Elekes 
construction in order to channel the estimates involving the ratio set into ones using 
the product set. These will have to bootstrap the Elekes exponent |, thus making 
the estimates involving the product set still worse. 

3. Proof of Theorem [T] 

The condition |^| > 2 is tantamount to assuming that ^ A. Consider a point 
set A X A in the coordinate plane. 

The multiplicative energy of A is defined as the number of solutions of the 

following equation 

E^{A) = |{(ai, . . . 04) G ^ X . . . X ^ : 01/02 = 03/04}!. 
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By the Cauchy-Schwarz inequality 
(17) E4A)>maxl ' ' ' ' 



Geometrically, E^{A) is the number of ordered pairs of points of ^ x A in the plane 
on straight lines through the origin, whose slopes r are members of the ratio set 
A : A. Hence, elements of A : A can be identified with lines through the origin 
supporting points of A x A. 

The proof will deal with an in some sense "popular" subset L of these lines, P 
denoting a subset of A x A supported on the lines in L. There will be two cases to 
consider. 

Ratio set case. In order to establish the first two estimates of ([3]) the notation L 
will stand for the set of popular lines through the origin, namely those supporting at 
least ^I^I^IA : A\ points of A x A. The subset of yl x ^ supported on these lines L 
will be denoted as P. One has \P\ ^ as well as \A\ < \L\ < \A : A\. Let us also 
use the notation N for the maximum number of points per line, clearly < \A\. It 
follows from (|17p that if n(/) denotes the number of elements of P supported on a 
line I G L, 

(18) Y."'W»W^r 

Product set case. In order to establish the last two estimates of ([3]), the same 
notations P, L will apply to slightly differently defined sets, as well as the quantity 
A^, as follows. The set P will be a "popular multiplicative energy" subset of A x A, 
constructed by the standard dyadic pigeonholing procedure by popularity, in terms 
of supporting points of A x A, of all lines through the origin. The elements of P 
are those points in A x A which lie on a set L of straight lines passing through the 
origin, supporting between N/2 and points each, and such that 

(19) 1^1^' > I. [^^^ I.I - 

|^-^|log|A| 

Such sets L,P always exist, by the pigeonhole principle and (fT7|) . 

Important in the product set case will be the following bounds on A^. 

Lemma 2. There exists L,N satisfying U9\) and such that 



, , 1 \A? \A±A\^\A-A\ 

The lower bound in (j20p follows right away from (jl7p and a popularity argument. 
The upper bound comes from a variant of the Elekes construction ([Tj) apropos of 
sum-products. A variant of Lemma [2] can be found in the above-mentioned works 
[8], [TT]. A short proof is given in the Appendix. 

In both the ratio and product set cases, it can be assumed that \L\ ^ A^. It is 
clear in the ratio set case, where A^ < |^| and thus, by ([TS]) . \L\ ^> > \A\. 
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In the product set cases |P| ^ \L\]^- Assume that » \L\. Since A*" satisfies (i20]l 
and LN'^ is bounded from below by ()19p . it fohows that 

\A±A\^\A- A\^ :^ ' ' 



log|A|' 

which is far better than the last two claims in ([3]). 

Beginning now the proof proper, in either of the two cases above consider the sum 
set of P with some other set Q in the plane, with \Q\ > \P\. (In the sequel Q = ±P 
or P lb P). To obtain the vector sums, one translates the lines from L to each point 
of Q, getting thereby some set C of lines with \C\ < |-L||Q|. 

The Szemeredi- Trotter theorem, namely (jlj), enables one to estimate \C\ from 
below: 

\L\\Q\^\C\I\Q\I. 

(It is easy to see that the fact that since \L\ < \Q\ < \C\ < \L\\Q\, trivial terms in 
this application of ([4]) can be ignored.) Thus 

(21) |£| > |L|f|g|l. 

Let us call the number of points of Q on a particular line I £ C, the weight m{l) 
of I. The total weight W of all lines in the collection £. is by construction equal to 

I^IIQI- 

The lines in C have been given weights, because the same line I G C can contribute 
to the same vector sum in P + Q at most max(A^, m(Z)) times. Hence, let us lower 
the weights of lines, which are "too heavy": whenever m{l) > N, redefine it as A". 

Therefore, for the total weight W and the mean weight rh per line, from (I2ip one 
has 




(22) W<\L\\Ql rh-- 

The Szemeredi- Trotter theorem, namely ([5]), tells one that the weight distribution 
over C obeys the inverse cube law. I.e., for t < N, one has 

(23) \Ct\ = |{/ G £ : m{l) > f}| « 1^ + M « '^1' 



as since A^ <^ \/\Q\j the trivial term gets dominated by the first term. 

Now, let us look at the set V{C) of all pair- wise intersections of lines from C, and 
for an intersection point p £ P(£) of some k > 2 lines li, . . . ,1/. look at the sum of 
the weights of the lines that intersect there: 

k 
i=l 

For any point set V C V{C), the number of weighted incidences between V and C 
is the sum over all pairs {p.,1) G I{V,C), counting each pair {p,l) with the weight 
m{p). 



8 



MISHA RUDNEV 



The inverse cube weight distribution over the set of hnes C, enables one to use 
the Szemeredi- Trotter theorem rather efficiently for counting weighted incidences, 
similar to how it was done in the paper of losevich et al. (See jS], Lemma 6). 

Lemma 3. Suppose, \Q\ > \P\, and the weights of lines in C have been capped by 
N. For X €z P + Q, let n(x) be the number of realisations of x as a sum. Then for 
t: N <^t< \P\, 

iLlf IQlt 

(24) \{x(^P + Q: n{x)>t}\ < I I 



t3 ' 

Proof. The condition \Q\ > \P\ S> N'^ (which holds in both the ratio and product 
set cases) ensures that (j23p is valid, since for all I G £, m(l) < N. Observe that for 
any point set V, the number of weighted incidences of C with V can be bounded 
from above using dyadic decomposition of C by weight in excess of m, via 

log2 Af-log2 fh 

(25) l<^im{V,Cfn)+ Yl (2^m)|I(P, £2,^)1, 

i=i 

where Cm denotes the set of lines from C whose weight is at most fh, m denotes 
the dyadic group of lines whose weights are approximately 2-'m, and the notation 
im lias been introduced in the context of Theorem [3l 

One uses the Szemeredi- Trotter theorem to estimate each number of incidences 
\I(V,C2jrh)\? for j > involved and its weighted version for the first term. The 
condition (j23p ensures that the weighted Szemeredi- Trotter estimate ^ for the first 
term in (I25p dominates as follows. For j > 1, one has 

(26) 1/(7^,^.^^)1 « i\n^2^fn\f' +\'P\+\^2^rn\- 

For the case j = 0, Theorem is used. 

(27) imiV,Cm) <.m^\V\W)^ +fh\V\+W. 

Together with the rest of the bounds for each |£2Jml coming from ([23]) this yields 

(28) I ^m^\V\W)i +N\V\+W. 

(See (jSip below where the bounds in (j26p for the terms for j > 1 are spelled out 
explicitly.) 

Observe now that for x £ P + Q CI V{C), with n[x) denoting the number of 
realisations of a sum set element x, one always has n{x) < m{x), where m{x) is the 
weight of member of V{C). 

Applying (128]) to the point set 

(29) Vt = {xeP + Q: n{x) > t}, 

with the lower bound t\Vt\ for I, one sees that for t ^ N the term N\V\ in the 
right-hand side of (128p cannot possibly dominate the remaining terms. Hence for 
t ^ N one has 

(30) 
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and has to be slightly more careful with the trivial term in the bound ()30p and 

refine it, so that it becomes absorbed into the first term to get (j24p . This is clearly 
the case for t < y^\L\\Q\^, but not yet for higher t. 

Let us now address the issue of large t. The lines in C come in \L\ possible 
directions, and therefore no more than \L\ lines can be incident to a single point. 
Hence, lines from a dyadic group >C2Jm do not contribute to sets Vt, whenever t ^ 
\L\ ■ {2^m). This means that one needs, with (j23p in view, only a "tail" estimate for 
the right-hand side of (j25p . with t ~ \L\(2^m): 

log^N-iog^rh \n\l\v\i ini2 

(31) V (2-m)|7(P,,A.^)|« +N\Vt\+ 



i=j 

It follows that, since N <^t < \P\, 

Since \Q\ > \P\ > \L\, the first term in the last estimate dominates the second term, 
thus proving (|24p . □ 

Let us now apply the estimate ()24p with Q = itP; of these two the case Q = —P 
will serve to estimate E^{P). 

The assumptions of Lemma [3] are satisfied, and therefore 

21og2|A| 

(32) ^3(^)= n\x)^N^\P\'' + \L\l\P\l ^ « |L| § |P| § log 

xeP~p j=i 



since in both the ratio and product set cases A^^ <^ y^\P\\L\^. 

Using this, the first formula in (|16p . with A replaced by P, yields: 



\P\f 

(33) E{P,P-P)^ 3 U . 

|L|^|P-P|log|A| 

To get a similar estimate for the sum set P + P, let us use the second formula 
in (|16p . with A replaced by P and D = P — P. To estimate the quantity t = 
max^g£)+ \Pii\ it suffices to observe that part of the energy E{P) supported on 
is bounded from below by (jlip . where A gets replaced by P, and from above, by 
Lemma [3l as 

/ , , 3 , , 5 \ 

O N\P\^ + 



t I 

It follows that unless |P + P| ^ which is much better than ([3D, one has 

\L\hp + P\ 

(34) max \Pd\ <C ' ' ' 3 ^• 



d€D+ \p\2 
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Then, using the second estimate in (fT6]) and (i32]l . one gets 
(35) ^(P,P + P)» 



L|3|P + P|3log|^|' 

On the other hand, one can use Lemma [3] with Q = P ziz P and estimate the 
quantity E{P, P ziz P) from above. Then for any t ^ N: 

, , 3 , ,5 
L 2 P lb P 2 

(36) E{P,P±P) <^\P\\P±P\t+ ^ ' ' ^ ^, 

and choosing 

|P±P|t|L|t 
t = ±J ' > A^, 

to match the two terms in (j36p yields 

(37) £;(P,P±P) < vl^l-Pi-Pl^l^- 
Combining this with (|33p results in 

,11, , a IPP 
38 P-P-L4>.^^. 

log 1^1 

To obtain the first estimate of ([3]) it suffices to note that one has (with what the 
notations P, L stand for in the ratio set case) |P| ^ 1-^1 ^ |^ : ^|) plus 

|P-P| < \A-A\^. 

To prove the second estimate in ^ the estimates ([ST]) and ([35]) get put together. 
It follows that 

I 7->l 2i 

, 19 , ,15 P 4 

(39) |P + P|-|L|-> 



log 1^1 



Using |L| < 1^ : ^1, |P| » and |P + P| < |yl + Ap results in the second estimate 
in ©. 

In the product set case, where |P| = \L\N, the estimates (138 ] 1391) become 



1^ - A\^ > , 
(40) ViVlogl^l 

,,19 (|L|iV2)ia 



iVlog |A| 



Lemma [2] now supplies a non-trivial upper bound on A^. Substituting the bounds 
from (fT9]) and ([20]) into (|l0]) then yields the last two estimates of dSj) and completes 
the proof of Theorem [TJ □ 
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3.1. Appendix. Proof of Lemma [2]. The lower bound of (I20p is merely the 
multiplicative version of the popularity argument behind (llOp . resulting in (jlip . 
Indeed, the multiplicative energy of A supported on those lines through the origin, 
which correspond to ratios r £ A : A, whose number of realisations n(r) > ^ \a^a\ ' 



is > 

On the other hand, it was proven in [8], |llj that the multiplicative energy coming 
from the lines through the origin, supporting at least points of A x A (that from 

the set of all ratios r £ A : A such that n(r) > A^), is O (J—^-^^- This, together 

with (|19p settles the upper bound in (|20p and proves Lemma [2l 

For completeness sake, a simple version of the proof of the upper bound in (I20p for 
N is given below. This bound was derived in [8], via the Szemereed- Trotter type 
estimates for convex functions, using a particular example of a convex function, the 
exponential. Let us show that the bound in question, in fact, represents a variant of 
the well known construction of Elekes apropos of sum-products, which gave the 
exponent | implicit in the bounds ()20p . The notation in the forthcoming argument 
is somewhat independent from the rest of the paper. 

Consider a set A, not containing zero and a set of lines L = {y = ^±^}^ where d 
is an element of the difference set A — A (or the sum set A + A, the modification 
required being trivial) and a £ A. Clearly there are \A — A\\A\ lines. Therefore, 
the number of points in a set Pt, where at least t < \A\ lines intersect is, by ([5]), 
bounded as 

, , \A-A\^\A\^ 

Given a point {x,y) G Pt, one has an intersections of at least t lines of L there, 
namely y = '^'^^ for at least t different pairs {di,ai). For each di take some fixed 
representation di = Ui — Vi, let a G A he a variable. Clearly di + x = (uj — a) + (a — 
Vi + x) = d[ + x'. I.e., Pt is such that if it contains one point (x, y), then it contains 
at least \A\ distinct points with the same ordinate. 

Conversely, if Rt is the subset of ratios from A : A which have at least t reali- 
sations, it is clearly a subset of the set of ordinates of the points from Pt. Indeed, 

if r = ^ = . . . = ^ , then it equals to the ordinate of a horizontal family of in- 
tersection points of at least k lines, identified by the pairs (di = a[ — a,ai), these 
intersection points having the abscissae x = a. 
It follows that 

(41) \Rt\ « 

and hence the multiplicative energy supported on Rt is O [ I . Compar- 



t 

ing this with the lower bound in terms of jjrj-^ gives the upper bound in (j20p . where 
t has been replaced by A^. □ 
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